Priming the cache #168

tgwizard · 2024-02-22T13:28:35Z

Sometimes data is loaded through some means, which will also later be loaded by an independent batch data loader. It can be convenient to then prime the cache of that data data loader with the already loaded data, to remove redundant calls.

The use-case I have can be summarized by this query:

query {
  shopsFollowed(first: 10) {
    nodes {
      id
      followedByMe
    }
  }
}

The shopsFollowed resolver performs a query to load 10 shops that the current user follows. This is returned as the Shop type, which is used extensively in the API. The Shop.followedByMe field returns a boolean of whether the current user follows the shop. It's backed by a custom batch data loader.

For the specific shopsFollowed use-case, we know that all the shops returned are followed by the current user, so there's no need for the custom batch data loader to load this again.

With prime we can have the shopsFollowed resolver pre-populate this information on the loader.

This is available in other graphql batch data loader libraries, see e.g.

swalkinshaw

Nice and simple 🎉

michaelherold

I like how simple this is. I wonder if the simplicity could potentially hide performance bugs. What do you think?

michaelherold · 2024-02-22T15:44:30Z

test/loader_test.rb

+  def test_prime_will_not_replace_already_cached_value
+    loader = EchoLoader.for
+
+    assert_equal :a, loader.load(:a).sync
+
+    loader.prime(:a, :not_a)
+
+    assert_equal :a, loader.load(:a).sync
+  end


comment: This seems like it potentially is a footgun for surprising behavior. Particularly with ActiveRecord's lazy loading of associations, what if you try to prime with a model that has preloaded an association that you need (N.B. if you're using GraphQL::Batch, you probably shouldn't be loading associations like this, but stranger things have happened), but it has already been fulfilled with a version that does not have that association preloaded?

question: Should we emit a warning or something by default when attempting to prime an already-fulfilled promise? One could then opt in to allow either overriding or ignoring the warning.

This could very well be overkill but it's the first thing that came to mind when I saw the implementation.

Yes, this is indeed not meant to be used to populate the loader with values that are in a certain state, for some kind of perf reasons. The values you put in should behave the same as the values the loader would have loaded on its own.

I'm not sure there's a good and simple way to protect against this, except in documenting its intended use.

Given this, I don't think we should emit a warning when priming a value for a key that already exists in the loader cache. That is meant to be a no-op - loading or priming should result in equivalent results - so it shouldn't seem to the developer that they are doing something wrong. Given the resolution of GraphQL it can be very hard to reason or know/ensure that you'd be priming keys that won't exist.

The behaviour here (priming inserts a key in the cache only if the key didn't exist) is also the behaviour of the other data loader libraries I found.

What do you think?

tgwizard · 2024-02-22T21:02:19Z

I updated the prime logic a bit, to deal with the cases when your prime a key that's already in the queue:

     def prime(key, value)
-      promise = cache[cache_key(key)] ||= ::Promise.new.tap { |p| p.source = self }
-      promise.fulfill(value) unless promise.fulfilled?
+      cache[cache_key(key)] ||= ::Promise.resolve(value).tap { |p| p.source = self }
     end

tgwizard requested review from swalkinshaw and a team February 22, 2024 13:28

swalkinshaw approved these changes Feb 22, 2024

View reviewed changes

michaelherold reviewed Feb 22, 2024

View reviewed changes

tgwizard force-pushed the prime branch from 2050fe2 to c5570d2 Compare February 22, 2024 16:04

Priming the cache

2473813

tgwizard force-pushed the prime branch from c5570d2 to 2473813 Compare February 22, 2024 21:01

swalkinshaw approved these changes Feb 22, 2024

View reviewed changes

tgwizard merged commit 7aaf3a2 into main Feb 23, 2024
23 checks passed

tgwizard deleted the prime branch February 23, 2024 07:58

shopify-shipit bot deployed to rubygems February 23, 2024 16:06 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Priming the cache #168

Priming the cache #168

tgwizard commented Feb 22, 2024 •

edited by amomchilov

Loading

swalkinshaw left a comment

michaelherold left a comment

michaelherold Feb 22, 2024

tgwizard Feb 22, 2024

tgwizard commented Feb 22, 2024

Priming the cache #168

Priming the cache #168

Conversation

tgwizard commented Feb 22, 2024 • edited by amomchilov Loading

swalkinshaw left a comment

Choose a reason for hiding this comment

michaelherold left a comment

Choose a reason for hiding this comment

michaelherold Feb 22, 2024

Choose a reason for hiding this comment

tgwizard Feb 22, 2024

Choose a reason for hiding this comment

tgwizard commented Feb 22, 2024

tgwizard commented Feb 22, 2024 •

edited by amomchilov

Loading